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Abstract 

Cellulases play a key role in enzymatic routes for degradation of plant cell-wall polysaccharides into simple and 
economically-relevant sugars. However, their low performance on complex substrates and reduced stability under 
industrial conditions remain the main obstacle for the large-scale production of cellulose-derived products and 
biofuels. Thus, in this study a novel cellulase with unusual catalytic properties from sugarcane soil metagenome 
(CelE1) was isolated and characterized. The polypeptide deduced from the celE1 gene encodes a unique glycoside 
hydrolase domain belonging to GH5 family. The recombinant enzyme was active on both carboxymethyl cellulose 
and B-glucan with an endo-acting mode according to capillary electrophoretic analysis of cleavage products. CelE1 
showed optimum hydrolytic activity at pH 7.0 and 50 °C with remarkable activity at alkaline conditions that is 
attractive for industrial applications in which conventional acidic cellulases are not suitable. Moreover, its three- 
dimensional structure was determined at 1.8 A resolution that allowed the identification of an insertion of eight 
residues in the B8-a8 loop of the catalytic domain of CelE1 , which is not conserved in its psychrophilic orthologs. This 
8-residue-long segment is a prominent and distinguishing feature of thermotolerant cellulases 5 suggesting that it 
might be involved with thermal stability. Based on its unconventional characteristics, CelE1 could be potentially 
employed in biotechnological processes that require thermotolerant and alkaline cellulases. 
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Introduction 

In the face of growing energy costs, dwindling fossil 
resources, environmental pollution and a globalized economy, 
the large-scale use of biotechnology instead of, or to 
complement, traditional industrial production processes, 
particularly in the chemical sector, is viewed as both an 
opportunity and a necessity to a more social and ecological 
sustainable energetic matrix [1]. In this context, metagenomics 
has received much attention owing to its great potential to 
provide new biocatalysts with diverse functions and 
applications. To illustrate that, a soil sample might contain in 
the order of 10 4 different bacterial species and more than one 
million novel open reading frames, many of which encode 



putative enzymes [1]. This approach has been broadly 
employed to find new enzymes to assist ethanol production 
through lignocellulosic biomass degradation thereby enabling a 
new generation of biofuels production [2]. 

The current scenario for the reduction of biomass into 
fermentable sugars via enzymatic routes is characterized by 
very scanty exemplars of glycoside hydrolases suitable for this 
process in an industrial scale and rate. Thus, the discovery of 
new enzymes with higher catalytic efficiency and stability under 
industrial conditions, and even specialized for the different 
biomass sources, such as sugar-cane bagasse, may have a 
revolutionary role in making biofuel production from plant 
biomass economically viable [3,4]. It is more pronounced when 
focused on cellulose degradation since cellulases are 
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considered the main bottleneck in biomass breakdown, 
principally due to very low catalytic efficiency. Moreover, 
cellulose is the major polysaccharide found in vegetal biomass 
and plants produce about 180 billion tons of cellulose per year 
globally, making this carbohydrate the largest organic carbon 
reservoir on earth [5]. Consequently, efficient breakdown of 
cellulose is an essential pre-requisite for the production of 
biofuels and cellulases are key enzymes in this process [6]. 
The catalytic domains of cellulases are found in 14 families of 
glycoside hydrolases that have been classified according to 
their sequence [7]. Cellulases or endoglucanases (EC 3.2.1.4) 
catalyze the cleavage of random internal (3-1 ,4-glycosidic 
bonds in cellulosic chains and can be found coupled to the 
cellulosome or generally secreted as independent enzymes. 
Very often, the catalytic domain of cellulases is associated to 
one or more carbohydrate-binding modules, which binds to the 
substrate and increase the catalytic efficiency of some 
enzymes. 

Here, we report a novel cellulase belonging to GH5 family, 
named as CelE1, retrieved from a sugarcane soil metagenomic 
library, which is a promising biocatalyst in biofuels production. 
The thorough biochemical and structural characterization, 
providing details about three-dimensional structure, catalytic 
properties and stability of CelE1, might contribute to broaden 
our understanding of the molecular basis for glycoside 
hydrolases adaptation to use surgacane biomass as substrate 
and then serve as an instrumental model for enzyme redesign 
and optimization aiming at vegetal biomass degradation. 

Materials and Methods 

Identification, cloning and sequencing of the CelE1 
gene 

To isolate cellulolytic clones, a functional screening of a 
metagenomic library derived from sugarcane field land soil was 
performed [8]. Briefly, recombinant E. coli cells were spread on 
plates containing 0.5% (w/v) carboxymethyl cellulose (CMC) as 
substrate and colonies producing clear hydrolytic halos were 
selected by staining with Congo red [9]. 

The plasmid was extracted from the positive clone and 
sequenced using universal M13 forward and reverse primers 
on an ABI Prism 377 Genetic Analyzer (Applied Biosystems, 
USA) at the Brazilian Bioethanol Science and Technology 
Laboratory. The sequence was analyzed using Geneious Pro 
4.8.5 and the identified ORF was compared to public 
databases available at NCBI by BLASTx tool. The nucleotide 
sequence of CelE1 gene was deposited in the GenBank 
database under the accession number KF498957. 

Protein expression and purification of the catalytic 
domain 

The coding region corresponding to the GH5 catalytic 
domain of CelE1 was amplified by PCR using primers 
containing A/del (5'- 

TATATAT CATATG GTCGCACCCATTACTACCAGC-3') and 
SamHI (5 '-ATA G GATCC TTACGGC CAAC C G G AAATAAT-3 ' ) 
restriction sites (underlined) for cloning into the plasmid 
expression vector pET-28a(+) (Novagen). 



The plasmid pET28a harboring the encoding sequence of 
celE1 was transformed in E. coli Rosetta-gami™(DE3)pLys 
cells and the protein expression performed under standard 
conditions at 37 °C for 4 hours in selective LB medium 
(kanamycin) containing 0.5 mM IPTG (isopropyl B-D- 
thiogalactopyranoside). The cells were harvested, resuspended 
in lysis buffer (20 mM sodium phosphate pH 7.4, 100 mM 
NaCI, 5 mM benzamidine, 1 mM PMSF) and then sonicated 
with 6 pulses of 30 s at 500 W, using a VC750 Ultrasonic 
Processor (Sonics Vibracell). The lysate was clarified (20,000 
xg for 30 min) and the supernatant was loaded onto a nickel- 
affinity column (GE Healthcare) which was washed and the 
recombinant protein eluted using a non-linear gradient of 
imidazole (0 to 500 mM). The fractions were then pooled, 
concentrated and subsequently applied to a size-exclusion 
chromatography column (Superdex 75, GE Healthcare), pre- 
equilibrated with 50 mM sodium phosphate buffer pH 7.4, 150 
mM NaCI. The sample purity was confirmed by polyacrylamide 
gel electrophoresis under denaturing conditions [10]. Protein 
concentration was estimated by absorbance at 280 nm using 
molar extinction coefficient for the polypeptide deduced from 
gene ( http://web.expasy.org/protparam/ ). 

Enzyme characterization and cellulase activity 

The hydrolytic activity was determined by quantifying the 
amount of reducing sugar released from different 
polysaccharides using the 3,5-dinitrosalicylic acid method [11]. 
One unit (U) was defined as the quantity of enzyme required to 
release 1 umol of reducing sugar per min. To determine the 
optimum pH, the enzymatic reactions were carried out in 200 
mM phosphate-citric acid-glycine buffer (pH 2-12) containing 
0.5% (w/v) CMC and, incubated at 37 °C during 20 min. For 
optimal temperature determination, the reactions were in the 
same buffer (pH 7.0) but incubations were in the range of 15 - 
90 °C. 

Experimental conditions (reaction time and enzymatic units) 
were adjusted to guarantee the estimation of initial velocities 
(hydrolysis of no more than 5% initial concentration of 
substrate to obtain linear response of product formation in 
respect to reaction time) for the determination of kinetic 
parameters K m , V max and /c cat . Assays were conducted in 50 mM 
phosphate buffer pH 7.0 at 50 °C during 15 min using 
0.06-5.4% (w/v) CMC as substrate. Due to the natural 
heterogeneity of the substrate, the apparent affinity constant K m 
was expressed as mg.ml 1 . Mathematical adjustments were 
made using the software Graph Pad Prism 5.0 (GraphPad 
Software) to calculate parameters. The assays were performed 
in quintuplicate. 

The substrate specificity was evaluated against a set of 
natural polysaccharides at 37 °C using Mcllvaine buffer (pH 6) 
during 30 min [12]. Hydrolytic activity over industrially-relevant 
insoluble substrates including pretreated sugarcane bagasse 
(BEX) (a generous gift from Prof. Dr. George Jackson) and 
Avicel PH-101 (from Sigma USA) was also tested. In these 
experiments, a reaction containing 1 .0% of each substrate in 
200 mM sodium phosphate buffer pH 7.5 were incubated with 
10 ug of recombinant protein at 50 °C during 200 min and 40 
°C during 24 h, under constant agitation (1,000 rpm). The 
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released sugars were separated from residual polysaccharide 
by centrifugation 10,000 xg for 20 min and quantified onto 
supernatant. 

Capillary electrophoresis of oligosaccharides 

Cellohexaose (from Megazyme) and oligosaccharides 
released by enzymatic activity were derivatized with 8- 
aminopyrene-1 ,3,6-trisulfonic acid (APTS) by reductive 
amination. Enzymatic reactions were performed as described 
previously, using cellohexaose a substrate [13]. Capillary zone 
electrophoresis of oligosaccharides was performed on a P/ACE 
MQD instrument (Beckman Coulter) equipped with laser- 
induced fluorescence detection. A fused-silica capillary 
(TSP050375, Polymicro Technologies) of internal diameter of 
50 pm and total length of 31 cm was used as separation 
column for oligosaccharides. Electrophoresis conditions were 
15 kV/70-100 pA at a controlled temperature of 20 °C. 
Oligomers labeled with APTS were excited at 488 nm and 
emission was collected through a 520 nm band pass filter. 

Circular dichroism spectroscopy and thermal unfolding 
studies 

Far-UV circular dichroism (CD) spectra of CelE1 (20 mM 
phosphate buffer, pH 7.4 at 25 °C) was measured in the range 
195-260 nm in a Jasco J-810 spectropolarimeter (Jasco 
International Co.) coupled to a Peltier temperature controller 
using a 1 mm quartz cuvette. Protein concentration was set to 
10 pM and the results were expressed as mean residue 
ellipticity (deg. cm 2 . dmor 1 . residue 1 ). A total of 30 spectra were 
collected, averaged and corrected by subtraction of the blank. 
In order to investigate the thermal stability, CD spectra were 
analyzed at different temperatures ranging from 20 to 100 °C. 
Thermal unfolding was monitored by CD intensity changes at 
222 nm. Data deconvolution for prediction of secondary 
structure was performed in DichroWeb server [14,15]. 

Crystallization 

Crystallization experiments were performed by the sitting- 
drop vapor-diffusion method using a Cartesian HoneyBee 963 
system (Genomic Solutions) and a protein concentration of 
17.5 mg.ml" 1 . 544 different formulations based on commercial 
crystallization screens were tested including those from 
Hampton Research (SaltRX, Crystal Screen and Crystal 
Screen 2), Emerald BioSystems (Precipitant Synergy and 
Wizard I and II) and Qiagen/NeXtal (PACT and JCSG+). Sitting 
drops were prepared by mixing 0.5 pi of the protein solution 
with an equal volume of mother liquor and equilibrated against 
80 pi of the reservoir solution at 18 °C. Small and clustered 
crystals were obtained from the condition consisting of 35% 
(v/v) isopropanol, 30% (w/v) PEG3350 and 100 mM Tris-HCI 
pH 8.5. The condition was refined varying the isopropanol 
concentration and by adding glycerol to prevent crystal 
clustering. Best crystals were grown in 7 days from the 
condition 30% (v/v) isopropanol, 30% (w/v) PEG3350, 100 mM 
Tris-HCI pH 8.5 and 5% (v/v) glycerol as additive. 



Table 1. Crystal data and refinement statistics for CelE1. 





Parameter 


PDB code 


4M1R 


Data collection 


Space group 


P12-,! 


Cell dimensions 


a, b, c (A) 


41.88, 87.51, 66.63 


a, P,vC) 


90, 98.55, 90 


Resolution (A) 


50.00- 1.80(1.86-1.80) 


Emerge 


9.6 (30.2) 


</ / a/> 


16.9 (3.9) 


Completeness (%) 


93.5 (80.4) 


Multiplicity 


6.5 (3.8) 


Refinement 


Rwork / Rfree (%) 


13.33/ 17.34 


Protein molecules in ASU 


2 


Ligand/ion 


2 


Water molecules 


379 


Mean S-factor (A 2 ) 


12.9 


R.m.s. deviations 


Bond lengths (A) 


0.024 


Bond angles (°) 


1.676 


Ramachandran Plot 


Favored (%) 


96.3 


Allowed (%) 


3.7 



Values in parentheses are for highest - resolution shell, 
doi: 10.1371/journal.pone.0083635.t001 



Data collection and processing 

X-ray diffraction data were collected on W01B-MX2 beamline 
at the Brazilian Synchrotron Light Laboratory (Campinas, 
Brazil). The wavelength was set to 1 .459 A and the intensities 
were recorded in a Mar Mosaic 225 mm charged-coupled 
device detector. The complete data set were collected to a 
maximum resolution of 1.78 A. Data were indexed, integrated, 
merged and scaled using the HKL2000 package (Otwinowski 
and Minor, 1997). The reflections were indexed in the 
monoclinic crystal system with unit-cell parameters a= 41.88, 
b=87.51 c=66.63 A. An examination of the systematic 
absences indicated that the crystal belonged to space group 
P2 : . Calculation of the Matthews coefficient [16] based on the 
molecular weight of 32,186 Da (monomer) resulted in a V M of 
1.88 A 3 .Da 1 and a solvent content of 34.48%, which 
corresponds to the presence of two molecules per asymmetric 
unit. The statistics of the data processing are summarized in 
Table 1. 

Structure determination and refinement 

The structure of CelE1 was solved by molecular replacement 
using the program MOLREP [17] and a model based on the 
atomic coordinates of a psychrophilic cellulase from 
Pseudoalterromonas halaplanktis (PDB code 1TVP, [18]), 
which displays 67% sequence identity with CelE1. Two clear 
solutions were observed, which confirmed the presence of two 
molecules in the asymmetric unit and the initial model after 
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Figure 1. Effect of pH and temperature on the hydrolytic activity of CelE1. Enzyme was incubated in 200 mM phosphate-citric 
acid-glycine buffer containing 0.5% (w/v) of CMC as substrate for 20 min and the amount of reducing sugars measured by the 3,5- 
dinitrosalicylic acid method. (A) Measurements were carried out in pH values ranging from 2 to 12 by incubation at 37 °C. (B) 
Activity assayed under different temperatures at pH 7. Assays were performed on quadruplicate aliquots. Each experiment was 
repeated three times. 

doi: 10.1371/joumal.pone.0083635.g001 



rigid-body refinement resulted in Rf ac t or /Rfree of 44/48%. 
Structure refinement was carried out with the program 
REFMAC5 [19]. After each cycle of refinement, the model was 
inspected and manually adjusted into the (2F 0 -F C ) and (F 0 -F 0 ) 
electron density maps using the program COOT [20]. In the 
later cycles, water molecules and ligands were added manually 
and refined. The atomic coordinates and structure factors of 
CelE1 have been deposited in the Protein Data Bank under the 
accession code 4M1 R. 

Results and Discussion 

Isolation and identification of a novel cellulolytic gene 
from sugarcane soil metagenome 

In order to obtain new cellulolytic enzymes, functional 
screening of a metagenomic library using CMC as substrate 
was carried out. One clone of 3100 bp displaying cellulase 
activity was isolated and sequenced which included the 
complete CelE1 -encoding open reading frame. CelE1 is 1284 
bp long and encodes a polypeptide of 428 amino acid residues 
containing a putative N-terminal signal peptide followed by a 
glycoside hydrolase domain belonging to family 5 (GH5). 

The polypeptide corresponding to the GH5 domain of the 
enzyme shares high sequence identity to previously 
characterized glucanases including the endoglucanase from 
Cellvibrio japonicus Ueda107 (80% identity, GenBank 
accession number YP_001983438.1), a cellulase isolated from 
a bacterium enrichment culture (75% identity, GenBank 
accession number ACR23656.1) and an endoglucanase from 
an uncultured organism (74% identity, GenBank accession 
number ACY24859.1). These analyses confirm that CelE1 
belongs to the family 5 of glycoside hydrolases. 



CelE1 displays unusual pH dependence for catalysis 

To obtain the catalytic domain and advance with the 
biochemical and structural characterization, the coding region 
of CelE1 was sub-cloned into pET28a(+) vector, produced in E. 
coll cells and purified to homogeneity. Recombinant CelE1 was 
active (relative activity > 30%) over a broad pH range from 5 to 
10 with maximum enzyme activity at neutral pH (Figure 1A). In 
addition, the enzyme retains 70% of the activity in pH values of 
8 and 9. Usually cellulases belonging to GH5 family display an 
optimum for activity at slightly acid pH as observed for the 
hyperthermophilic CelB from Caldicellulosiruptor 
saccharolyticus (pH 5.5) [21], Cel5G derived from a soil 
metagenomic library (pH 4.8) [22], BsCel5A from Bacillus 
subtllis (pH 6) [23] and cellulase 5 from the buffalo rumen 
metagenomic library (pH 5.5) [24]. However, this novel 
cellulase exhibited a bell-shaped curve for pH dependence, 
shifted towards basic conditions, which can be attractive for 
biotechnological applications in which conventional acidic 
cellulases are not suitable. 

Furthermore, the effect of temperature on activity of CelE1 
was examined by monitoring the hydrolysis of CMC in the 
range from 10 to 90 °C at pH 7 (Figure 1B). The enzyme 
activity increased in a temperature-dependent manner 
reaching a maximum at 50 °C as described for other 
characterized cellulases [23-26]. Interestingly, the enzyme 
showed more than 60% of the relative activity at temperatures 
around to 70 °C and higher levels even at low temperatures (10 
- 50 °C) indicating that CelE1 could be considered as a 
thermotolerant enzyme with significant catalytic activity ( > 
65%) at a broad temperature range from 10 to 70 °C. 

The apparent kinetic parameters were determined in the 
presence of different concentrations of CMC resulting in K m and 
values of 6.05 ± 0.37 mg.ml 1 and 24.54 s 1 , respectively. 
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Figure 2. Cleavage pattern of CelE1 on different cellooligosaccharides (cellotetraose (C4), cellopentaose (C5) or 
cellohexaose (C6)) indicating a classical endo-acting mode. (A) Capillary-zone-electropherogram of the APTS-labeled- 
cellohexaose hydrolysis (substrate). (B) Analysis of APTS-labeled products of C4, C5 and C6 hydrolysis. 

doi: 10.1371/journal.pone.0083635.g002 



Based on these values the catalytic efficiency was 4.06 mg.ml 
~ 1 s~ 1 (/f cat /K m ) that is similar to that observed for other glycoside 
hydrolases such as Thermoanaerobacter tengcongensis 
cellulase [27], cellulase 5A from Bacillus subtilis [23] and the 
cellulase 5A from Clostridium thermocellum [28]. Altogether, 
the catalytic efficiency of CelE1 combined with the ability to be 
active in alkaline pHs and in a broad range of temperature, 
suggest a potential application of this enzyme for industrial 
processes involving cellulose degradation. 

CelE1 is an endo-acting cellulase with high activity on 
complex and industrially-relevant polysaccharides 

In order to define the action mode for CelE1, capillary zone 
electrophoresis experiments were carried out with 
cellohexaose (Figure 2). The cleavage pattern with the 
predominance of cellobiose, cellotriose and cellotetraose as 
final products in an oligosaccharide-length manner indicates 
that the enzyme most likely attacks the internal glycosidic 
linkages. It confirms that CelE1 is an endoglucanase with 
action mode similar to the cellulase GH5 from Erwinia 
chrysanthemi [29]. 

Moreover, the substrate specificity of CelE1 was assessed 
by measuring the hydrolytic activity against eight 



carbohydrates. Table 2 shows that CelE1 was able to degrade 
efficiently CMC and B-glucan, the latter being most efficiently 
hydrolyzed. To further examine the ability of this cellulase to 
breakdown insoluble complex carbohydrates with industrial 
interest, hydrolytic activity was investigated using Avicel and 
BEX as substrates. The rate of sugar released using Avicel 
was 2- and 8-fold higher than obtained for cellulases Cel5A 
from Gloeophyllum trabeum and endoglucanase CgEG1 from 
Coptotermes gestroi, which were 4.5 x 10 3 and 1.1 x 10" 3 
U.mg 1 , respectively [30,31]. The glycoside hydrolase CelE1 
was also active on BEX (Table 2). These findings make CelE1 
a promising candidate with potential applicability in industrial 
processes involving the deconstruction of plant biomass. 

Structural stability of CelE1 

CD analysis of CelE1 resulted in a typical far-UV spectrum of 
a/B proteins as expected for a GH5 family member indicating a 
native-like structure of the recombinant enzyme (Figure 3A). In 
order to investigate its thermal stability, unfolding experiments 
were performed (Figure 3B). The enzyme presented a melting 
temperature (T M ) of 55 °C, which is in agreement with the 
temperature for maximum catalytic activity (Figure 1B). 
Interestingly, CelE1 showed significant higher thermal stability 
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Figure 3. Biophysical characterization of CelE1. (A) Far-UV CD spectrum of CelE1 with typical profile of a/B proteins. (B) 
Thermal denaturation profile characterized by a single transition and a melting temperature of 55 °C. 

doi: 10.1371/journal.pone.0083635.g003 



Table 2. Substrate specificity of CelE1 . 





Substrate 


Specific 


activity 


(U/mg) 


Simple carbohydrates 


CMC 


13.1 


± 


1.2 


3-glucan 


38.8 


± 


4.4 


Galactomannan 


0.0 


± 


0.0 


Xyloglucan 


0.0 


± 


0.0 


Xylan from Beechwood 


0.0 


± 


0.0 


Rye Arabinoxylan 


0.0 


± 


0.0 


Sigmacell cellulose (type 20) 


2.0 


± 


0.6 


Pectin 


0.4 


± 


0.02 


Complex carbohydrates 


Avicel 


(8.8 


± 


0.3) x 10" 3 


BEX 


(10.2 


± 


0.1) x 10" 3 



doi: 10.1371/joumal.pone.0083635.t002 



than its psychrophilic orthologs such as Cel5G from 
Pseudoalteromonas haloplanktis (T M = 43 °C), despite high 
sequence identity (67%) [32]. Thus, high-resolution structural 
data combined with extensive comparative analysis could 
provide insights into the molecular basis for structural stability 
differentiation among highly similar cellulases 5. 

Crystallographic depiction of CelE1 at high resolution 

The structure of this novel cellulase 5 was determined by X- 
ray crystallography at 1.8 A resolution. The crystals belonged 
to the monoclinic space group P2 : with two molecules in the 
asymmetric unit (Table 1), which could suggest a quaternary 
arrangement. However, analysis of crystalline contacts using 
the PDBePISA server [33] did not indicate the presence of any 
stable protein-protein interface discarding the existence of 
CelE1 as a dimer in solution. The monomeric form was also 
confirmed by dynamic light scattering (DLS) measurements 



(Figure S1 in File S1) and analytical size-exclusion 
chromatography (Figure S2 in File S1). DLS measurements 
resulted in a hydrodynamic radius of 2.8 nm (20.6% 
polydispersity) that corresponds to the monomer of CelE1 . 

The final model converged to a R work /R free of 13.33/17.34% 
with excellent global and local stereochemical properties (Table 
1 and Table S1 in File S1). The enzyme displays a canonical 
(B/a) 8 -barrel fold (also known as TIM-barrel) characteristic for 
members of GH5 family, which is composed by eight B-sheets 
surrounded by eight a-helices (Figure 4A). This structural 
scaffold is extremely versatile, being able to harbor a number 
of activities related or not to glycoside hydrolases [34]. The 
long loops connecting the C-terminal end of B-strands with the 
N-termini of the external layer of helices delineate the 
negatively charged catalytic cleft (Figure 4B). The opposite 
side of the active site contains an extra anti-parallel B-sheet, 
commonly present in cellulases 5 that contributes to 
conformational stability [23,35-38]. 

By comparison with other cellulases the residues Glu 152 and 
Glu 240 were identified as the catalytic acidic residues, where 
Glu 152 is the proton donor and Glu 240 the nucleophile. These 
residues are separated by a distance of 6.1 A, compatible for a 
retaining catalytic mechanism (Figure 4C). Other important 
residues for catalysis are conserved including His 117 and His 212 
[25,35,39,40] and the aromatic gatekeepers Trp 189 and Trp 241 
(Figure 4C). These tryptophan residues play a key role in sugar 
binding through aromatic stacking interactions with the 
glucopyranosyl rings [41]. 

The extended B8-a8 loop generates additional 
intramolecular contacts that might contribute to CelE1 
thermal stability 

CelE1 is structurally similar to other characterized GH5 
cellulases with RMSD values for Ca atoms of 0.39, 0.43, 0.56 
and 0.64 A in relation to cellulases from Erwinia chrysanthemi 
[35], Pseudoalteromonas haloplanktis [18], Bacillus 
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Figure 4. Structural studies of CelE1. (A) Overall structure of the CelE1 showing a classical (B/a) 8 -barrel fold with the two 
catalytic acidic residues depicted. (B) Surface charge distribution with highlight to highly negatively charged active-site pocket that is 
essential for substrate binding and cleavage. (C) Details of the active site in which catalytically-relevant residues are indicated. 

doi: 10.1371/journal.pone.0083635.g004 



agaradhaerens [42] and Bacillus subtillis [23], respectively. 
Despite the fully conserved catalytic cleft, some significant 
structural differences were found in the interfacial loops that 
vary in length and composition (Figure 5A and Figure S3 in File 
S1). In contrast to the cellulase from Erwlnia chrysantheml 
(EcCel5, PDB code 1EGZ) and its psychrophilic ortholog 
Cel5G from Pseudoalteromonas haloplanktis (PDB code 
1TVN), CelE1 contains an insertion of eight residues in the (38- 
a8 loop ( 292 ANGGWTSS 299 ). This extended loop promotes 
additional intramolecular contacts that might be related to the 
significant higher structural stability of CelE1 in comparison to 
the Cel5G from Pseudoalteromonas haloplanktis and other 
psychrophilic orthologs (Figure 5B). Moreover, other cellulases 
5 with similar thermal stability as those isolated from 6. subtilis 
(BsCel5A, Tm ~60°C) [23] and B. agaradhaerens (BaCel5, 
PDB code 1QHZ) [42] also have an equivalent insertion in the 
68-a8 loop, supporting our finding (Figure 5B). The comparison 
with both BsCel5A and BaCel5 reveals other small insertions or 
deletions in interfacial loops (Figure S3 in File S1), but these 
regions are not implicated in generating new intramolecular 
contacts as those observed by the 8-residue-long insertion in 



the 68-a8 loop for CelE1. These analyses suggest that 

additional residues in the 68-a8 loop might contribute to 

stabilize the structure improving the thermal tolerance of 
cellulases. 

Conclusion 

A new plant cell wall-degrading enzyme with ability to 
breakdown complex cellulose-based substrates was isolated 
and characterized from sugarcane soil metagenome. This 
enzyme was shown to be an endo-acting glucanase with high 
catalytic activity at a broad temperature range and under 
alkaline conditions that usually cause enzyme inactivation of 
classical acidic cellulases. Moreover, its crystal structure was 
determined at 1.8 A resolution indicating that the 8-residue- 
long insertion in the B8-a8 loop might confer higher 
conformational stability in comparison to its psychrophilic 
orthologs. This work contributes to both basic and applied 
knowledge of cellulases and provides a promising biocatalyst 
with unusual biochemical properties for industrial processes 
involving degradation of lignocellulosic materials. 
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Figure 5. Comparative structural analysis of CelE1 (4M1R) with other structurally similar cellulases 5. (A) Representation of 
the extended a 8 /p 8 loop conserved in thermostable enzymes (BsCel5A, Bacillus subtilis, 3PZU; BaCel5A, Bacillus agaradhaerens, 
1QHZ) in comparison to meso- and psychrophilic cellulases (EcCel5, Erwinia chrysanthemi, 1EGZ; Cel5G, Pseudoalteromonas 
haloplanktis, 1TVN). The helix a1 that makes new interactions with the extended a 8 /(3 8 loop is colored in light pink. (B) Surface 
complementarity between the extended a 8 /(3 8 loop (yellow mesh) and the neighboring structural elements (green) indicating the 
additional intramolecular contacts favored by this motif. 

doi: 10.1371/joumal.pone.0083635.g005 
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